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270 1 TTATTATCCAGAACATCTA GTTAATCATTACTT CCAAACTAGA CACTATTTACAC ACTCT 

HNF1 HNF3 

276 1 ATGG AAGGCGGGTA TA TTATATAA GAGAGAAACAACACATAGCGCCTCATTTTGTGGGTC 

Spl TBP RNA Start 

282 1 ACCATATTCTTGGGAACAAGATCTACAGCATGGGGC 

PreS 1 protein start 



Fig. IB 
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•96 — GGTATATTATATAAGAGAG/TBP-wt 
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CTATCGCCGACGGCAGTCC/HNF1 -m 
CTAGTTAATAATTAATTCC/HNF1 -21X 
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Fig, 2 



1081 CTA AGC AGG CTT TCA CTT T CT CGC CAA CTT ACA AGG CCT TTC TGT GTA AAC AAT 

NF1 (1100-1119) 

2c (1119-1134) 

1135 ACC TGA ACC TTT A CC CCG TTG CCC GGC AAC GGC CA G GTC TGT GCC AAG TGT TTG 

EF-C (1148-1168) 



1189 CTG ACG CAA CCC CCA CTG GCT GGG GCT TGG TCA TGG GCC ATC AGC GCA TGC GTG 

E (1180-12 02) NF1 (12 09-123 6) X-PBP (1229-12 45) 

12 4 3 GAA CCT TTT CGG CTC CTC TGC CGA TCC ATA CTG CGG AAC TCC TAG CCG CTT GTT 



12 97 TTG CTC GCA GCA GGT CTG GAG CAA ACA TTA TCG GGA CTG ATA ACT CTG TTG TCC 



1351 TAT CCC GCA AAT ATA CAT CGT TTC CAT GGC TGC TAG 138 6 
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CAGCTGGG CCGCCCTTGT 
GCGGGGGCCA ACGCGATTGT GGGTGCTCGG 
CCCTGCTGGG GCAACCCATC GCTCCCCATG 
CCCGGAATAT TAGTAATCCT AATTCCCGGC 
GAAAGGTGGG GGTGGGGGGG GTCGCATCTT 
CTTTTTCTAT CAGTTTTCTT TGAGCTTTTA 
CTGAACTATA TTCAAAAGGA AGTAAATGAA 
AAAATAAAAA CAAAAGTTAA GACAGTAAAA 
ACAGAACCTG TAATTTTAAA AACTGTGTAT 
ATATTGGGGA CCCTCTCATG TAACCACGAA 
GTACACTCGT TTGTTTAATT GATAATTGTT 
CACGCTCACG AATTCAGTCC CAGGGCAAAT 
ACAAAACCAA TTAGGAACTT CGGTGGTCTT 
CAATTTAATT TCTTTTTTAA TTAAAAAAAA 
CTTTCCATTC AGAGGTGTGT TTCTCCCGGT 
CAGTTGGGGA CCCCCGCAAG GACCGACTGG 
CAGGCTAGAA GGACAAGATG AAGGAAATGC 
TTCGGGCATT TATTTTATTT TATTTTTTGA 
ACTTTTAGGG TTACCCCCTT GGGCATTTGC 
TGCACAGGGG TTGTGTGCCC GGTCCTCCCC 
TTACACGTGT TAATGAAAAT GAAAGAAGAT 
CGCCCGTGGG TGCCCTCGTG GCGTTCTTGG 
GGGTGTCGCC GCGCCCCAGT CACCCCTTCT 
GCCTTCCTAG TTGTCCCCTA CTGCAGAGCC 
ACCCACTCGA GGCGGACGGG GCCCCCTGCA 
AGCGGGGCGA TTTGCATTTC TATGAAAACC 
GGCGCGGCGC CTCAGGGATG GCTTTTGGGC 
CCCGCGCCCC CTCCCCCTGC GCCCGCCCCC 
CTTTGATCTT TGCTTAACAA CAGTAACGTC 
TGCAAAGTCC TGGAGCCTCC AGAGGGCTGT 
CACGCTCCGG CGAGGGGCAG AAGAGCGCGA 
AGCGCGGACC CAGCCAGGAC CCACAGCCCT 



GCGCGGGCTG ATGCTCTGAG GCTTGGCTAT 
GGAGTGGGGG GGGGCACGAC CGTAGGTGCT 
CGGAATCCGG GGGTAATTAC CCCCCCAGGA 
GGGGGAGGGG GCGCGGGAGG AATTCACCCT 
GCTGTGAGCA CCCTGGCGAA GGGGAGAGGG 
CTGTTAAGAG GGTACGGTGG TTTGAT GACA 
CAGTTTTCTT AATTTGGGGC AGGTACTGTA 
TGTCCTTTTA TTTTTTAATG CACCAAAGAG 
TTTAATTTAC ATCTGCTTAA GTTTGCGATA 
CACCTATCGA TTTTGCTAAA AATCAGATCA 
CTGAATTATG CCGGCTCCTG CCAGCCCCCT 
TCTAAAGGTG AAGGGACGTC TACACCCCCA 
GTCCCAGGCA GAGGGGACTA ATATTTCCAG 
TGAGTCAGAA TGGAGATCAC TGTTTCTCAG 
TAAATTGCCG GCACGGGAAG GGAGGGGGTG 
TCAAGGTAGG AAGGCAGCCC GAAGAGTCTC 
TGGCCACCAT CTTGGGCTGC TGCTGGAATT 
GCGAGCGCAT GCTAAGCTGA AATCCCTTTA 
AACGACGCCC CTGTGCGCCG GAATGAAACT 
GTCCTTGCAT GCTAAATTAG TTCTTGCAAT 
GCAGTCGCTG AGATTCTTTG GCCGTCTGTC 
AAATGCGCCC ATTCTGCCGG CTTGGATATG 
CGTGGTCTCC CCAGGCTGCG TGCTGTGCCG 
ACCTCCACCT CACCCCCTAA ATCCCGGGGG 
CCCCTCTTCC CTGGCGGGGA GAAAGGCTGC 
GGACTACAGG GGCAACTCCG CCGCAGGGCA 
TCTGCCCCTC GCTGCTCCCG GCGTTTGGCG 
GCCCCCCTCC CGCTCCCATT CTCTGCCGGG 
ACACGGACTA CAGGGGAGTT TTGTTGAAGT 
CGGCGCAGTA GCAGCGAGCA GCAGAGTCCG 
GGGAGCGCGG GGCAGCAGAA GCGAGAGCCG 
CCCCAGCTGC CCAGGAAGAG CCCCA 



Fig. 4 



10 20 30 40 50 60 70 

GAATTCACTG GGGAGAGCAT TCAGGAAGAT GACAACAGGA TAATAGGTCA ACAGAGTAAT AGAGAGGTCG 
CTTAAGTGAC CCCTCTCGTA AGTCCTTCTA CTGTTGTCCT ATTATCCAGT TGTCTCATTA TCTCTCCAGC 

80 90 100 110 120 130 140 

CTAAAAATAA ACTCTAAGAA GTATTCAGCC AAAACTATTA TTGAGCTAAT AATGGTGGGA TCAATTTCAG 
GATTTTTATT TGAGATTCTT CATAAGTCGG TTTTGATAAT AACTCGATTA TTACCACCCT AGTTAAAGTC 

150 160 170 180 190 200 210 

GGGAATATTG TGGGCAGAAG TCAGACTGTA GGAGGCTGGG GATCAAGAAG TTGAGGCAAG GAGGTTGGAC 
CCCTTATAAC ACCCGTCTTC AGTCTGACAT CCTCCGACCC CTAGTTCTTC AACTCCGTTC CTCCAACCTG 

220 230 240 250 260 270 280 

AACAACTGTT TTTTCAAGTT GGTCACGTGA ACAAATCTGT GACCTTCAGC CTCCCCTCCC TCGGGTCTTG 
TTGTTGACAA AAAAGTTCAA CCAGTGCACT TGTTTAGACA CTGGAAGTCG GAGGGGAGGG AGCCCAGAAC 

290 300 310 320 330 340 " 350 

GCTGAGCTGA TTGCAGGGCC CCTGCAGCTC TGGCACTCTC AAGTTGTATA AAACTGACAG TGCAGAAGTC 
CGACTCGACT AACGTCCCGG GGACGTCGAG ACCGTGAGAG TTCAACATAT TTTGACTGTC ACGTCTTCAG 

360 370 380 390 400 410 420 

}|y CTTGAGCCCA TTTTGGCTCT CATGATAATT TTCCTTCAGT GGAACTAAGG TTACTTGTCT AAGAAC C AAA 

%i GAACTCGGGT AAAAC C GAGA GTACTATTAA AAGGAAGTCA CCTTGATTCC AATGAACAGA TTCTTGGTTT 

; a s 

430 440 450 460 470 480 490 

I'sJ GCCTCTGACT TGACTGATCA AAGTTCATCA CGTGCATCGA AGCCACCTAC TTGGCAGATG TAGTGAAAAG 

'f m l CGGAGACTGA ACTGACTAGT TTCAAGTAGT GCACGTAGCT TCGGTGGATG AACCGTCTAC ATCACTTTTC 

^ 500 510 520 530 540 550 560 

il3 CTACATAGAT CTGGGCCCAG GACAGGATGC TGGGGCGTGG GAGGGGAAGA AAGCAGGTGC TAACTATATA 

ijl GATGTATCTA GACCCGGGTC CTGTCCTACG ACCCCGCACC CTCCCCTTCT TTCGTCCACG ATTGATATAT 

rfl 570 580 590 600 610 620 630 

frl GATAGCATGC CTATCAGAGC AGTTTTTACG TTTCCTATTT GTCTCTCAAA ACAATTTTAT AGGAATCATC 

j^f CTATCGTACG GATAGTCTCG TCAAAAATGC AAAGGATAAA CAGAGAGTTT TGTTAAAATA TCCTTAGTAG 

640 650 660 670 680 690 700 

AAAGCAATTT TATCATGGTT TCTAGACCAG GTTTGGATGT GAGGTAGGGA TTTCCACAGC TGCTTTTAGT 
TTTCGTTAAA ATAGTACCAA AGATCTGGTC CAAACCTACA CTCCATCCCT AAAGGTGTCG ACGAAAATCA 

710 720 730 740 750 760 770 

T T G AAGG AAA TCTGATAAGA TGATGCAAAA GCCCTTCAGA AATGTGTAAT CCTACACACT TCAGTGATTC 
AACTTCCTTT AGACTATTCT ACTACGTTTT CGGGAAGTCT TTACACATTA GGATGTGTGA AGTCACTAAG 

780 790 800 810 820 830 840 

AATTCATTGT C AAAAC T T AA GGTGTTTTTA ATATTGTTAT TGTTCATTTG GTTTTTACCA AC AT GT AAGG 
TTAAGTAACA GTTTTGAATT CCACAAAAAT TATAACAATA ACAAGTAAAC CAAAAATGGT TGTACATTCC 

850 860 870 880 890 900 910 

AGTTGGCAAT TATTTGTTAA ACTCATGTCT TAGGCTAAAT AAATTCCAAA AAATTCAGGA TGAGAATTGT 
TCAACCGTTA ATAAACAATT TGAGTACAGA ATCCGATTTA TTTAAGGTTT TTTAAGTCCT ACTCTTAACA 
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920 930 940 950 960 970 980 

TTATTGCTTA ACGTGTGTCA AATTTCTTCC ATGCACATCT TTATTAGATC TTCACAGCAA CCTACAGGAT 
AATAACGAAT TGCACACAGT TTAAAGAAGG TACGTGTAGA AATAATCTAG AAGTGTCGTT GGATGTCCTA 

990 1000 1010 1020 1030 1040 1050 

AAGCAAGACA GGTGCAAGTG CCTCCTTTGG GTATGAGGAA ACTGAGGTCT AAAGAGATGA AGTGATTTGC 
TTCGTTCTGT CCACGTTCAC GGAGGAAACC CATACTCCTT TGACTCCAGA TTTCTCTACT TCACTAAACG 

1060 1070 1080 1090 1100 1110 1120 

CCAAGGCTCA TAGCAATTTA TTGGTAGAGC AAAGACTAGA ATTCTCTTAA CTGCAGCCTA TTTTCCCTAT 
GGTTCCGAGT ATCGTTAAAT AACCATCTCG TTTCTGATCT TAAGAGAATT GACGTCGGAT AAAAGGGATA 

1130 1140 1150 1160 1170 1180 1190 

TCTGAACTGT TACATCAGCA TCAACAATTA TCTAATGGAT TGGAACAGTG TACACAGGCA GCTTAGCTAC 
AGACTTGACA ATGTAGTCGT AGTTGTTAAT AGATTACCTA ACCTTGTCAC ATGTGTCCGT CGAATCGATG 

1200 1210 1220 * 1230 1240 1250 1260 

GTCAAGTCAC GATTTTTACT TTAACTTCAA TTCCAGAGTC TTGGCCTGAT TTCCCTCAAG ACCCTACTTA 
CAGTTCAGTG CTAAAAATGA AATTGAAGTT AAGGTCTCAG AACCGGACTA AAGGGAGTTC TGGGATGAAT 

1270 1280 1290 1300 1310 1320 1330 

TCTTTGGCTT TGGAAAATTT ATTTTTCTTG CATTATCTTT CCAGCTAAAT TTTATTTAAT AACCATCAGC 
AGAAACCGAA ACCTTTTAAA TAAAAAGAAC GTAATAGAAA GGTCGATTTA AAATAAATTA TTGGTAGTCG 

1340 1350 1360 1370 1380 1390 1400 

ATGCTTTTTT TGCTTTATGC CAT G TAG ACT TGACCTGAAA ACCTGCCAGG CTTTCATTGA GTTTAGTGAT 
TACGAAAAAA ACGAAATACG GTACATCTGA ACTGGACTTT TGGACGGTCC GAAAGTAACT CAAATCACTA 

1410 1420 1430 1440 1450 1460 1470 

TAAAGAAGTA AAGTTCTGAG AAGCAATTAG TTGATGGGAC ACCAGTCATA AAATCAATCC AAACTTTTGT 
ATTTCTTCAT TTCAAGACTC TTCGTTAATC AACTACCCTG TGGTCAGTAT TTTAGTTAGG T T T GAAAAC A 

1480 1490 1500 1510 1520 1530 1540 

TGACATGTGT TTCTTTCTCC ATATACCAGG TTCCCGCTTC GTATTAGTAA GATTGAAATT GAAATAAGTC 
ACTGTACACA AAGAAAGAGG TATATGGTCC AAGGGCGAAG CATAATCATT CTAACTTTAA CTTTATTCAG 

1550 1560 1570 1580 1590 1600 1610 

TATTGCTGGT GGATGAATTT GTCACTTTCC TTGAAACTGG TGAACCCAAA AAGTTAGACA GTGATAGGAA 
ATAACGACCA CCTACTTAAA CAGTGAAAGG AACTTTGACC ACTTGGGTTT TTCAATCTGT CACTATCCTT 

1620 1630 1640 1650 1660 1670 1680 

AATACTGCCA TTGTCTGTTA AGAAGTCTAT GACATTTCAA GGCAAGAATG AAT AT AT GGA AGAAGAAACT 
TTATGACGGT AACAGACAAT TCTTCAGATA CTGTAAAGTT CCGTTCTTAC TTATATACCT TCTTCTTTGA 

1690 1700 1710 1720 1730 1740 1750 

TGTTTCTTCT TTACTTACAA AAAGGAAAGC CTGGAAGTGA ATGATATGGG TAT AAT T AAA AAAAAAAAAA 
ACAAAGAAGA AATGAATGTT TTTCCTTTCG GACCTTCACT TACTATACCC ATATTAATTT TTTTTTTTTT 

1760 1770 1780 1790 1800 1810 1820 

AAAACAAAAA ACCTTTACGT AACGTTTTGC TGGGAGAGAA GACTACGAAG CACATTTTCC AGGAAGTGTG 
TTTTGTTTTT TGGAAATGCA TTGCAAAACG ACCCTCTCTT CTGATGCTTC GTGTAAAAGG TCCTTCACAC 
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1830 
GGCTGCAACG 
CCGACGTTGC 

1900 
CTCTGCCACC 
GAGACGGTGG 

1970 
CTTCTCCCCG 
GAAGAGGGGC 



1840 
ATTGTGCGCT 
TAACACGCGA 

1910 
TTCTCTGCCA 
AAGAGACGGT 

1980 
ATCTGCGGCC 
TAGACGCCGG 



1850 
CTTAACTAAT 
GAATTGATTA 

1920 
GAAGATACCA 
CTTCTATGGT 

1990 
ACTGGACTGC 
TGACCTGACG 



1860 
CCTGAGTAAG 
GGACTCATTC 

1930 
TTTCAACTTT 
AAAGTTGAAA 

2000 
CCATCAGCAT 
GGTAGTCGTA 



1870 
GTGGCCACTT 
CACCGGTGAA 

1940 
AACACAGCAT 
TTGTGTCGTA 

2010 
GAAAATTTTT 
CTTTTAAAAA 



1880 
TGACAGTCTT 
ACTGTCAGAA 

1950 
GATCGAAACA 
CTAGCTTTGT 

2020 
ATGTATTTAC 
TACATAAATG 



1890 
CTCATGCTGC 
GAGTACGACG 

1960 
TACAACCAAA 
ATGTTGGTTT 

2030 
TTACTGTTTT 
AATGACAAAA 



2040 
TCTTATCACC 
AGAATAGTGG 

2110 
TGAACCACAA 
ACTTGGTGTT 

2180 
GGATGATGGT 
CCTACTACCA 



2050 
CAGATGATTG 
GTCTACTAAC 

2120 
GCCTTTATTA 
CGGAAATAAT 

2190 
AGAAAC C AAA 
TCTTTGGTTT 



2060 
GGTCAGCACT 
CCAGTCGTGA 

2130 
ACTAAATTTG 
TGATTTAAAC 

2200 
TAGAAGAATG 
ATCTTCTTAC 



2070 
TTTTGCTGTG 
AAAACGACAC 

2140 
GGGTCCTTAC 
CCCAGGAATG 

2210 
GTCTTGTGGC 
CAGAACACCG 



2080 
TATCTTCATA 
ATAGAAGTAT 

2150 
TAATTCATAG 
ATTAAGTATC 

2220 
ATAATGTTTG 
TATTACAAAC 



2090 
GAAGGCTGGA 
CTTCCGACCT 

2160 
GTTGGTTCTA 
CAACCAAGAT 

2230 
TTCCCTAGTC 
AAGGGATCAG 



2100 
CAAGGTAAGA 
GTTCCATTCT 

2170 
CCCAAATGAT 
GGGTTTACTA 

2240 
AATGAACTCT 
TTACTTGAGA 



2250 
CATATTCTTG 
GTATAAGAAC 

2320 
CCATACCATC 
GGTATGGTAG 



2260 
TCTCTGGTTA 
AGAGACCAAT 

2330 
TCTGTTATCC 
AGACAATAGG 



2270 
GGATCTTGGG 
CCTAGAACCC 

2340 
TGGGGCAAGT 
ACCCCGTTCA 



2280 
ATCTGGAGTC 
TAGACCTCAG 

2350 
GCCTCAGTTT 
CGGAGTCAAA 



2290 
AGACTGCCTG 
TCTGACGGAC 

2360 
CCACATCTGA 
GGTGTAGACT 



2300 
GGCTCAAATC 
CCGAGTTTAG 

2370 
GAAATGGGGA 
CTTTACCCCT 



2310 
TTGGCTCTGC 
AACCGAGACG 

2380 
TGGTAGTGGT 
ACCATCACCA 



2390 

GTCCATTTCA TAGAT 
CAGGTAAAGT ATCTA 



GAGATGTATATAATTTTTTAGGATy^ATCTCAAGGTTATCTTTACTTTTTCTTA 
GGAAATTAACAATTTAATATTAAGAAACGGCTCGTTCTTACACGGTAGACTTA 
ATACCGTAAGAACGAGCCGTTTTCGTTCTTCAGAGAAAGATTTGACAAGATTA 
CCATTGGCATCCCCGTTTTATTTGGTGCCTTTCACAGAAAGGGTTGGTCTTAA 
TT 



Fig. 6 
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TCTAGAAAAT AATTCCCAAT ATTGAATCCC 
AAGATAAGTT GAATTTGGTC ATGAAGGAAG 
ACAGCAAGGT CCAAAGACAG TCAGGTCTTC 
AGATGTCTGA TGTGAACTAA AAAGATTGGT 
GCTGTAAAGA AAAATTGAAA CTTGATTGTG 
GGACATGAAT CTGCAGGCCA GTGTTTGCAG 
CACAGACATG TGTTGTTTGG CCTGCAGAGG 
GAAT T AGT AG ' C C AT CAT T T T CAAAAAT C AA 
TGTTCTTGAA AATAATGAAA CATCTGGAAC 
CAGTTGGAGC TGCGTAGTGA CTGCCCACTT 
CAGCTGCCAC CACTTCCCTT TATCTCTCTA 
CTTAAGAGAT ATTTGGGTTT GAAACCTCTG 
GACAGATGCT ATCAGAGTTG TCTTTTAAGA 
ATAGCCTGGA AGAGGGCAGC CAGGGGAGAA 
ATGAGATGAT GGCTTCAACA TTGAGGACAG 
ATAAGCATCT ATGCAAAGGA AATAGCAGAT 
AAAGTTTATT CATAAGCCCC TCTTTTCATC 
TTGTTCTTCA CTCTCCTTCC TGGATCATGA 
TCCTATTCAT C AGT GAAT TC CAAGTGCCTA 
ACAAATATTC GTTCAGTGCA TGAATTGGAG 
AAAGGTTTAT TACATACCTA AATATGCTGA 
CCAAAACTCC CAGTTTGGAA ACAGAATTTT 
TTCAAAAAAT AT T TAT T AAA CAATGACCAT 
ATGATGAACA AAAACCAACA TGATTCCTGC 
ACAGATTGTG AGAAGTCCTC CAACAAATAC 
GTGGGTGAGG GACTTGAGCT AGCCTAGGTG 
TGCATAGGCA TTAACTAGAT AAATAGAGAG 
AC AGAAT AT G CAAAGGCAGG AAGGATTATG 
GTGTAAGTGG ATCTCAGAAA ACAGGAGGAA 
TCATGTGGGG TATGGTTAAT GTTTTGGACT 
CAGGACT T AA ACGGGGAAAT AATATGACCA 
AATATATATT GAGAGTTAAT TATTGACAAA 
AGTATTTTCA AAAAATGGTG CCAAAACAAT 
TCTACAAAAC TTAATTCAAA AT GGAT C AC A 
CTTCTGGAAG GAAAACACAG ATGGGAATCT 
ATATCTGACA CCATAATCCG TAAGTTAACA 
GACTTCATCA GAATTTAAAA TGTTTGTGCT 
AACTACAGAT AAGATATTGT TGAAT CAT AT 
ATAACTCTAA ACCCCCATAA TAAATTACAA 
AAAT T T AC AG TCTTCATCAA AGAAAGTATC 
TGCATCTTTA TTCATGGGGG GAT GAAAT AA 
AATACTAAAA TTAAAAAGAC TGACCATACC 
TACAATCAAC TTAGGTAGAT GATTTGGAAG 
TGTGACTCAC CCATTCCATG GCTAAGTATT 
AC AAA GAT GT T TAT AC AAAT ATTTATAGCA 
GAACCCAAAT GTC CATC AAA AGT GAAT GGA 
AATACTACTT AGCAATAAAG AAGAATGAGC 
TCAAAGGCAT TGTGCTCAGT GAAAGATGCA 
AGTTCCATTG ACATAAAACT CTGGAAAGTG 
CATTGGTTGC CTGAGGAGGA GGAGTATAGG 
CAATAAAAAC TTTTGGAATC ATAGATATAT 



AAAGAATTCA 


ACATTTGGGC 


TGTCGTTTGA 


61 


AGAGGGGGGA 


TACAATTTCA 


GTAAAAGGTA 


121 


AGTAGTATGG 


AGTATATTCA 


GAGGGAGCCA 


181 


GGTTGGTAGG 


AGGAAGAGGT 


G T GAGAA GAG 


241 


ATGGACTTTA 


AAGGCTAGGC 


TATGGGACTT 


301 


ACTGGCGCCC 


ATAACTGTCT 


ATCACAGCAA 


361 


TTTGGCCTGC 


AT GAT GAT TT 


TAAACCATCT 


421 


GAGATGCCAC 


ATTAAAATAT 


GGAATGCTGC 


481 


ATTGAGGCCA 


CATTCCTGAC 


TGACAGCAAT 


541 


TACATGGGGC 


ATCTGATCCC 


TAGTCGATTA 


601 


ATACCAAGCT 


CTTTTCACTC 


ATTTTTGTTA 


661 


ATGCAGGTAA 


TTGAGGGTTA 


TAGAGCAGAG 


721 


AAGAACCCTC 


TGTTCTTCAT 


TTTGTTGAAG 


781 


GTTAGGGCTG 


GAGCTATGAG 


AAAGCATAAG 


841 


AAAGAATATT 


GAGAT GAGAA 


AGTAGTCCAT 


901 


GTCCTCAAAT 


CAGCAGAGGC 


AACAACTCTG 


961 


TCCAATCCAG 


T T C AAAT GT A 


ATTATTTAAA 


1021 


ATGAGCTCCT 


TAAATGCAGG 


GTCCACAGTG 


1081 


GCACAGAGCC 


TGGCAAATAG 


TAAATGCTTA 


1141 


TGATTCTCTA 


CTTTGCCTCA 


TAAGTTGAAA 


1201 


AATCACAGGG 


CATTTGGCAA 


CCCCCCAAAA 


1261 


AATTCTGTGA 


AAATAAAATC 


CATTCATTTA 


1321 


GTCCACACCA 


GGCTGAGTCC 


TAAGGATTCA 


1381 


TCTTAGGAAA 


CATACAGTTC 


AGT GAGGAAA 


1441 


TGGGTGCTAT 


TAAAATATAT 


TAAAAGGTGA 


1501 


GTTCAGGAAG 


TCTTCCTGGA 


TGTGCTGATA 


1561 


AAGGAT GAAC 


CAACATTGCA 


GGTAGAGGGA 


1621 


GAGTCGTTGG 


AGGACCTGAA 


TAAAGGCCCA 


1681 


AGGTGTATGA 


GATGAGATCA 


GAGAGGCAGA 


1741 


TTTCTATTAA 


GAGCAATGGG 


GAGACAGTGA 


1801 


GATTAAACTT 


TCTAAAAAAC 


CCTCTATGCA 


1861 


GATTCAAAGG 


CAACAAAGTG 


GAGAGAGAAT 


1921 


AGGACATCTA 


TATTAAAAGT ■ 


TGGGTATCTG 


1981 


GACCTAAATG 


TAAAACTGAA 


AGC TAT AC AA 


2041 


GTGTGATCTT 


GAGTTTGAAA 


AT GAT T TAT T 


2101 


TAATTCATAA 


GTGAACAAAG 


T GAT GAAC TG 


2161 


T C AAAAGAC A 


CTGGTATGAT 


AAT GAAGAC A 


2221 


TTCTGATAAA 


GGAATTGTGG 


CTC AGAAT AC 


2281 


GTAGCCCAAT 


TAAAAAAAAA 


AAAAGAGAAA 


2341 


AATTGTAAAA 


TAAGCACATG 


AAAAATGCTC 


2401 


AAATTAAATG 


GGAAAGACAC 


CTCTAATTAG 


2461 


AAGTATTGGT 


GAAGT GGAAA 


TGTAAAATGA 


2521 


TTTCTTACAA 


AAGTAGGTGT 


ATACCTACCC 


2581 


TACCTGAGAG 


AAAT GAAAGA 


ATACATCCAT 


2641 


GTTTTATTTG 


TAGTAGCCCC 


AAACTGAAAA 


2701 


TAAACAAAGC 


GTGGTACAGC 


AAT GCAAT AG 


2761 


T AGT GAT AT A 


CATAACAGCT 


TAAATGTACA 


2821 


AGTAAAAAAA 


AAAAAGAGTA 


CATGCTGTAT 


2881 


AAAAACAGTC 


TATACTGACA 


GAAAGCAGAT 


2941 


AGAGGTGGAG 


GGAAAAT GTA 


CAAAGTGGCA 


3001 


TCACTATCTT 


GATTGAGTGA 


TGATTTCATG 


3061 
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AGTGCACGTG CGTGTGTCAA AAATGATCAA 
TGTATATATC AATTATACCT CAGTACGGCT 
CAGAACTGAT TCTAGGAAAG AGTGGAGGGA 
GAGAACGGTG CCTTCTGGAA GTGAACTAGG 
AGATGTGTGA GATACTTAGG AAATAAAACC 
ATCCTGGCTT TCCAGAGCAA AGAGCTAGGG 
TGCAGAGGAA GCTTATAGCA GAAACTTACT 
TGGTTCCATT T GAT TACT AA AAAACTGCAA 
GTACAAACAG TGAACTTATA TAATATTAAT 
GGGAAGAAAA TGAGATTTGA GCTGGAAGAT 
CCCTAATATG CTTATAGTAG. TTAACCTTTC 
TCTTTAAAAT AAAATAACAT AGTAATCCTT 
AGTTTGTTCT ATACACATTA TGTCTTTTAA 
TATTGGCAAC AGTTTTCAGA TGTGGAAACT 
CATAAACTTT GGAGTCTTTC AGACCCAGGT 
AACTTCAGAG CATTATTTCT CACCTTAATT 
CTCTTCACTC CTGTCCATTG TTTCTTCAGT 
TGTAGACACA TAGTTCTCCT GGCTCTGAGA 
CACGGCCCAC TCAATTCCTA TTGAGCAATG 
TGCAATGAGT ATTCAAATGA GGTTGAAGTA 
ATATTCAAGG AAGCTGTAGC CAT T AAAAT C 
AAAGGGTCTT CAGGCCTTTG TGTTCTGGAA 
ATGCGATTGA TGAGTCTAGA GTCTAGCTCA 
TTTTACTAAT GTGGAACATA GAACAAATTA 
ATAGAGCCTA TCATATCCAG TCTTCCTTGC 
GGTAAGTAAA TGCCTATGAT GTTTGGCTGT 
CCCATCTTTT CTCTGTAGAA GACTCAAAAA 
ACTTTTGGAA CAGAGAGCCT CCCCAGAATC 
CCCAGGTCTC TACTGACATA AAGGAAAGAT 
CTCTCTCTTG ATCCCATTCC CACTTGCATC 
CCTCCACAAA AGAGCAGTTG CAGTAACCCA 
AGAGGCGAAA AGCAGCCCTA TATAGGTTCT 
AAATATGCGA AGGCAACTTG GAAAAGCAAG 
CTCTGGACAA AATTGAGCGC CTATGTGTAC 
ACCTGCTTGT CTGGGTGATT TTGCCTTTGA 
GAACTTCCAG ACAGGAAGAA GGCAGAGAAG 
GAGGGTTCCT AGAGCAAATG GCACAATGCC 
CTAAGGTTTC AGCAAGTATC TGCTGGCTTG 
CTCTCCCACT CTCCCATCTG CGCGCTCTTA 
GAGCAGGTAT TCCTATCGTC CTTTTCCTCC 
ATTGGGCTTT GGAACCAAAT TTCCTGAGTG 
CCTAAACCTT GGTTTAGGAA ACCAGGAGCT 
AGCTAGCCTC TCCTGCCCTC GCCCACGCTG 
GCAGGCGTTA GCGCGCGGTG AGGGGAGGGG 
AGGTGGGAAG GCAAGGAGGC CGGCCCGGTG 
TTTGCTCTCC ACCTCCCAGC GCCCCCTCCG 
CGGGACGGTC CGGAGCAAGC CCACAGGCAG 
TAGCCGCTCC AGTGCTGTAC AGGAGCCGAA 
CCAGCGACAG CCAACGCCTC TTGCAGCGCG 
CTTTCCTCTT CGGTGAAGTT TTTAAAAGCT 



TTTATGCAAC TTTAAATATG TGCAGTTTAT 3121 
ATTAAAAAGA AACCCTCTGG CTGCACAATG 3181 
GGATGACCAT TTACAGTGCT CCAGGTGGAA 3241 
TTGGCAACAA CAGAGATGAA ATAAATGGGC 3301 
CGATGGTCAC CATTTTCCAA AGGTCAGCTC 33 61 
AAGACTTTAT TAATAAATCC CTCTTGAAGT 34 21 
CTCAACCTGA CTAATCTGAG AGAACACCTC 34 81 
AGAACAGGAG GAGAAAGAAG AAGAAAGCTG 3541 
CAATAATTGT CTCTTGTTCT TAAAAGCAAT 3 601 
CAGAGTTCAA AATCCAAATA AAGTATATGG 3 661 
CTGATAATGA TATAATTGTT GACAGCACCA 37 21 
CAGATTTGTA GAAGATCTTT CCTGTTTACA 37 81 
ATGACACACT AGCCTTCTGA GGGTAACTTA 38 41 
GTGAAGACAA TGTTGGTGAT GTGGAAGCAA 3 901 
TTGAATGTCA GACTGCTTTT TATTCAGAGT 3 9 61 
TTTTTTCAGG CCTCTTTGTG TCTATGTGTC 4021 
GATTTTTGCC ACCTTCCTTC ACTGTTAGTG 4081 ' 
GCCTATGTTA ATTCCATTCT ACCATCCTGC 4141 
CTAGTTGAAA GTTGTGGTGG GATTAAATGT 4 2 01 
TCTACGCATT CTACTTACAT ATGGTGAGGT 4 2 61 
TCAGGAAATA ATTTTTCACC TCCTCAGGTG 4 321 
GGTTCATTTA TAGCCATTTC CCAAATGACA 4 381 
AATAGCAATG GACTGGAAGA CTAGTTTAGG 4 441 
TGTCCTTGTT TCAGCCTGTT CATCTGTGAA 4 501 
CTTTAGGTTT GAGTTACCTT CTTTGGTCAA 4 5 61 
GCACAAGATA AAGCTACAAC AAAGCTACAA 4 621 
GCAAAAGAGA CCCAGGAAAA TCTCGGAATG 4 681 
AGAAGTCAAG GAATT TAAAC ATAGGGAAGG 4 741 
GTTTTCTTAT AGGTTTCACG TTTACATTTT 4 8 01 
TGCCACCTTT ACACAGGGCT TATGGGACCT 4 8 61 
CATCATCCTC TACGCCCTGG CTGTCCATCA 4 921 
ATCCTTGGAT AGTTCCAGTT GTAAAGTTTA 4 981 
CGGCTGCATA CAAAGCAAAC GTTTACAGAG 5041 
ATGGCAAGTG TTTTTAGTGT TTGTGTGTTT 5101 
GAGTCTGGAG AGTAGAAGTA CTGGTTAAAG 5161 
AGGGTAGAAA TGACTCTGAT TCTTGGGGCT 5221 
ACGAGGCCCG ATCTATCCCT ATGACGGAAT 52 81 
GTCATGGCTT GCTCCTCAGT TTGTAGGAGA 5341 
TCAGTCCTGA AAAGAACCCC TGGCAGCCAG 5 4 01 
CTCCCTCGCC CCACCCTGTT GGTTTTTTAG 54 61 
CTGGCCTCCA GGAAATCTGG AGCCCTGGCG 5521 
ATTCAGGAAG CAGGGGTCCT CCAGGGCTAG 5581 
CGCCAGCACT TGTTTCTCCA AAGCCACTAG 5 641 
AGAAAAGGAA AGGGGAGGGG AGGGAAAAGG 57 01 
GGGGCGGGAC CCGACTCGCA AACTGTTGCA 57 61 
AGATCCCGGG GAGCCAGCTT GCTGGGAGAG 5821 
AGGAGGCGAC AGAGGGAAAA AGGGCCGAGC 5881 
GGGACGCACC ACGCCAGCCC CAGCCCGGCT 5 941 
GCGGCTTCGA AGCCGCCGCC CGGAGCTGCC 6001 
GCTAAAGACT CGGAGGAAGC AAGGAAAGTG 60 61 



Fxq 1 <* SB 



CCTGGTAGGA CTGACGGCTG CCTTTGTCCT CCTCCTCTCC ACCCCGCCTC 

GCCTTCCCCC CCTCCCCCGT CTTCTCTCCC GCAGCTGCCT CAGTCGGCTA 

ACCCCCCTCA CCACCCTTCT CCCCACCCGC CCCCCCGCCC CCGTCGCCCA 

CCCGAGTTTG CAGAGAGGTA ACTCCCTTTG GCTGCGAGCG GGCGAGCTAG 

CAAAGAAGGC TCTTAGGAGC CAGGCGACTG GGGAGCGGCT TCAGCACTGC 

CGCCTGGTTA GGCTGCACGC GGAGAGAACC CTCTGTTTTC CCCCACTCTC 

CTCCTGCCTT CCCCACCCCG AGTGCGGAGC CAGAGATCAA AAGAT GAAAA 

TCTTCAGTAG CCAAAAAACA AAACAAACAA AAACAAAAAA CAAGAAATAA 

TAATAACTCA GTTCTTATTT GCACCTACTT CAGTGGACAC TGAATTTGGA 

TTTTGTTTTT TTCTTTTAAG ATCTGGGCAT CTTTTGAATC TACCCTTCAA 

ACAGACTGTG AGCCTAGCAG GGCAGATCTT GTCCACCGTG TGTCTTCTTC 

TTTGAGGCTG TCAGAGCGCT TTTTGCGTGG TTGCTCCCGC AAGTTTCCTT 

TCCCGCAGGT GGGCAGCTAG CTGCAGCGAC TACCGCATCA TCACAGCCTG 

CTGAGCAAGA GAAGGGGAGG CGGGGTAAGG GAAGTAGGTG G AAGAT T C AG 
GGATG 



CCCCCACCCT 


6 


121 


CTCTCAGCCA 


6 


181 - 


GCGCTGCCAG 


6241 


CTGCACATTG 


6 


301 


AGCCACGACC 


6 


361 


TCTCCACCTC 


6 


421 


GGCAGTCAGG 


6 


481 


AAGAAAAAGA 


6 


541 


AGGTGGAGGA 


6 


601 


GTATTAAGAG 


6 


661 


TGCACGAGAC 


6721 


CTCTGGAGCT 


6 


781 


TTGAACTCTT 


6 


841 


CCAAGCTCAA 


6 


901 




CA GGCCCCACAA 

TGGGACCATT TATTGAGCAA TTATTATGTG 

TTATCTCCTT TAACCCCAGC ATAGTATGTC 
ACCAGAGATG CTCAGTCACT TGCCCAAGGT 

TTTTTTTTTT TTTTTTGACA CGGAGTCTCA 

CAATCTCAGC TCACTGCAAG CTCTGCCTCC 

CTGAGTAGCT GGGACTACAG GCACCCGCCA 

CAGAGACAGG GTTTCACCGT GTTAGCCAGG 

CCTGCCTCGG CCTCCCAAAG TGATGGAATT 

TCAAGATTTG AACCCAGGTC CTCTTGGTCC 

TGCATACGCA CCTGTCCCTC TAGAGGTGCC 

GTTAGAAATG CTAGGTATAC CTGTAAAGAG 

AAGAGATGCT GGTGTCCTTC ATTCTCCAGT 

CCAGTATAGC TGCATTCTTG GCTGGGGCAT 

AAAATAAGGA GGCCCAGTTA AATTTGAATT 

AAATATGTCC CATGCAATAT CTTGTTGAAA 

TCCCCACCCC TACCACTAGG CCTAAGGAAT 

GAGAAGTGGA ATTAAGCAGG CTAATAGAAG 

TTGGGTGCTG GGTGCCTCCT TAAATAAGCA 

AAGTCTTTTT GGGCTGGGTG CAGTTGCTCG 

AGGCGGGAGG ATCACCTGAG CTTGGGAGTT 

CTGTCTTTAC TAAAAATACA AAAAAT T C AG 

AGCTGCTCGG GAGGCTGAGG CAGGAGAAT C 

AGCAGAGATC GCGCCATTGC TCTCCAGCCT 

AAAAAAAAAG TCCTTTCGAT GTGACTGTCT 

ATGCTTTTCA GATACTTCAA AGATTCCAGA 

AGGTAAACAC AACACATCCC CCTCCTTGAC 

AACCATTATT TGATATTAAA ACAATAGGCT 

AAGTTAGATA ACTGAGACTT AAAGGGTGTT 

GACTCCGGGG GAGGGGGCAG AGTCACCAGC 

TGTGAGAACG GCTGCAGGCA ACCCAGGCGT 

CGCGAAGAGA GGGAGAAAGT GAAGCTGGGA 

AGTTGGAGGG GGCGAGCTGG GAGCGCGCTT 

AGGAGGAGGG CTGCTTGAGG AAGTATAAGA 

TTGGGACCGG AGAAACCAGG GGAGCCCCCC 

CTTTACTGCG CCGCGCGCCC GGCCCCCACC 

CAGCCGGGTC CAGCCGGAGC CATGG 



AACCTAGATC TGCCCCAGTA TAACTAAATC 1501 

CCAAGTATTG CGCTGAGTGC TTCCAGAGCA 1561 

AGATGCTGTT TTACAGATGA GCCAACTGAG 1621 

GACATGACTG ATATGGAATA GAGTCAAGAT 1681 

CTCTGTCTCC CAGGCTGGAG TGCAGAGGCG 17 41 

CAGGTTCACG CATTCTCCTG CCTCAGCCTC 1801 

CCACACCTGG CTAATTTTTT GTATTTTTAG 18 61 
ATGGTCTCGA TCTCCTGACC TCGTGATCTG 1921 

ACAGGTGTGA GCCACCGCGA CTGGCCAGAT 1981 

CAGAGGCCCC TGTTTCTCAA CTCCCTAGCA 2 041* 

TGCTTAAGTG TGCTCAGCAC AT GGAAGC AA 2101 

GTGTGGGAGA TGGGGGGGAG GGAAGAGAGA 2161 

CCCTGATAGG TGCCTTTGAT CCCTTCTTGA 2221 

TCCAACTAGA ACTGCCAAAT TTAGCACATA 2 281 

TCAGATAAAC AATGAATAAT TTGTTAGTAT 2341 

TTAAAAAAAA AAAAAAAAGT CTTCCTTCCA 2 4 01 

AGGGTCAGGG GCTCCAAATA GAATGTGGTT 2 4 61 

GCAAGGGGCA AAGAAGAAAC CTTGAATGCA 2521 

AGAAGGGTGC ATTTTGAAGA ATTGAGATAG 2581 

TGGTTGTAAT TCCAGCACTT TGGGAGGCTG 2 641 

CAAGACCAGC CTCACCAACG TGGAGAAACC 2 701 

CTGGTCATGG TGGCACATGC CTGTAATCCC 27 61 

ACTTGAACCA GGGAGGCAGA GGTTGTGGTG 2 821 

GGGCAACAAG AGCAAAAGTT CGTTTAAAAA 28 81 

CCTCCCAAAT TTGTAGACCC TCTTAAGATC 2 941 

AGATATGCCC CGGGGGTCCT GGAAGC CACA 3001 

TATCAATTTT ACTAGAGGAT GTGGTGGGAA 30 61 

TGGGATGGAG TAGGATGCAA GCTCCCCAGG 3121 

AAGAGTGGCA GCCTAGGGAA ATTTATCCCG 3181 

CTCTGCATTT AGGGATTCTC CGAGGAAAAG 32 41 

CCCGGCGCTA GGAGG GACGA CCCAGGCCTG 3301 

GTTGCCGACT CCCAGACTTC GTTGGAATGC 33 61 

GCTCCCAATC ACCGGAGAAG GAGGAGGTGG 3421 

ATGAAGTTGT GAAGCTGAGA TTCCCCTCCA 34 81 

GGGCAGCCGC GCGCCCCTTC CCACGGGGCC 3541 

CCTCGCAGCA CCCCGCGCCC CGCGCCCTCC 3 601 
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